
(1) GENERAL INFORMATION: 

(i) APPLICANT: Gorman, Cornelia M., 
Groskreutz, Debyra J. 

(ii) TITLE OF INVENTION: Prohormone Convertase Transformed Cells and 

Polypeptide Synthesis ■ >v 



(ill) NUMBER OF SEQUENCES: 57 




(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: r.nentech, Inc. 

(B) STREET: 1 DNA Way 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

■ (A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 
(B) COMPUTER: IBM PC compatible 
■(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
(b) SOFTWARE: WinPatin (Genentech) 

(vi) CURRENT APPLICATION DATA:. 

(A) AP&LICATION NUMBER: 08/026143 

(B) - FILING DATE: Ol-Mar-1993 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/887265 

(B) FILING DATE: 22-MAY-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/803631 

(B) FILING DATE: 06-DEC-1992 




(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US92/10621 

(B) FILING DATE: 04-DEC-1992 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Love, Richard B. 

(B) REGISTRATION NUMBER: 34,659 

(C) REFERENCE/DOCKET NUMBER: P0748P3 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-5530 

(B) TELEFAX: 650/952-9881 
(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2355 base pairs 



(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TCTAGATCTA GCTGGTGTGT CTCTGATCTT GCTTCTTTTC TCCCAGCCCT 50 
TCCTACTTGT GTGAGAACAA GGTTTTGAGC CATGGAGCAA AGAGGTTGGA 100 
CTCTGCAGTG TACTGCTTTC GCCTTCTTTT GCGTTTGGTG TGCACTAAGC 150 
AGTGTAAAAG CAAAGAGGCA GTTTGTTAAT GAATGGGCGG CGGAGATCCC 200 
CGGAGGGCAA GAAGCTGCCT CTGCCATCGC CGAAGAACTG GGGTATGACC 250 
TTTTGGGTCA GATTGGATCA CTTGAAAATC ACTATTTATT CAAACACAAA 300 
AGCCATCCTC GGAGGTCCCG AAGAAGCGCT CTTCATATCA CTAAGAGGTT 350 
ATCTGATGAT GATCGTGTGA CGTGGGCTGA ACAACAGTAT GAAAAAGAGA 400 
GAAGTAAACG TTCAGTTCAA AAAGACTCAG CATTGGATCT CTTCAATGAT 450 
CCAATGTGGA ATCAGCAGTG GTACTTGCAA GATACCAGAA TGACTGCAGC 500 
TCTGCCCAAG CTGGACCTTC ATGTAATACC TGTTTGGGAA AAGGGTATTA 550 
CTGGCAAAGG AGTTGTTATT ACTGTACTGG ATGATGGCTT GGAGTGGAAT 600 
CACACAGACA TTTATGCCAA TTATGATCCA GAGGCTAGCT ATGATTTTAA 650 
CGATAATGAT CATGATCCAT TTCCCCGATA TGATCTCACA AATGAAAACA 7 00 
AACATGGAAC AAGATGTGCA GGTGAAATTG CCATGCAAGC AAATAATCAC 750 
AAGTGTGGGG TTGGAGTTGC ATATAATTCC AAAGTTGGAG GCATAAGAAT 800 
GCTGGATGGC ATTGTAACTG ATGCCATTGA GGCTAGTTCA ATTGGATTCA 850 
'ACCCTGGCCA TGTGGATATT TACAGTGCAA GCTGGGGCCC TAATGATGAT 900 
GGAAAAACTG TGGAGGGGCC TGGCAGACTA GCCCAGAAGG CATTTGAATA 950 
TGGTGTCAAA CAGGGGAGAC AAGGGAAAGG CTCCATCTTT GTCTGGGCTT 1000 
CAGGGAATGG GGGTCGTCAG GGAGATAACT GTGACTGTGA TGGCTACACA 1050 
GACAGCATTT ACACCATCTC TATCAGCAGT GCCTCCCAGC AAGGCCTGTC 1100 
ACCTTGGTAT GCAGAGAAGT GTTCTTCCAC ATTGGCTACC TCCTACAGCA 1150 
GTGGTGATTA CACAGACCAG CGAATAACAA GCGCTGACCT GCACAATGAC 1200 
TGCACAGAGA CCCACACAGG CACCTCGGCT TCAGCACCCC TGGCTGCTGG 12 50 




TATCTTTGCT CTGGCCTTGG AGGCAAACCC AAATCTTACC TGGAGAGATA 1300 
TGCAGCATCT GGTTGTCTGG ACCTCTGAGT ACGACCCATT GGCCAGTAAC 1350 
CCAGGTTGGA AAAAGAATGG GGCAGGCTTG ATGGTGAACA GCCGATTTGG 1400 
ATTTGGCTTG CTAAATGCCA AAGCTCTGGT GGATTTGGCT GATCCTCGGA 1450 
CCTGGAGAAA TGTGCCTGAG AAGAAAGAAT GTGTTGTAAA AGACAATAAC 1500 
TTTGAGCCTA GAGCCCTGAA AGCTAATGGA GAAGTAATTG TTGAAATCCC 1550 
AACAAGAGCT TGTGAAGGAC AAGAAAATGC TATCAAGTCT CTGGAACATG 1600 
TGCAATTTGA AGCAACAATT GAATATTCTC GTAGAGGAGA CCTTCATGTC 1650 
ACACTCACTT CTGCTGTTGG AACCAGCACT GTACTGTTGG CTGAAAGGGA 17 00 
AAGAGATACA TCCCCCAATG GCTTTAAGAA TTGGGACTTC ATGTCTGTTC 1750 
ATACATGGGG AGAGAATCCT GTAGGCACCT GGACATTGAA AATTACAGAC 1800 
AtGTCTGGAA GAATGCAAAA TGSAGGAAGG ATTGTGAACT GGAAGTTGAT 1850 
TTTGCATGGG ACATCTTCTC AACCAGAGCA CATGAAGCAG CCCCGTGTGT 190u 
ACACATCCTA CAATACAGTC CAGAATGACA GGAGAGGAGT GGAAAAGATG 1950 
GTGAATGTTG TGGAGAAGCG GCCCACACAA AAGAGCCTGA ATGGCAATCT 2000 
CCTGGTACCC AAAAACTCCA GCAGCAGCAA TGTGGAGGGT AGAAGGGATG 2050 
AGCAGGTACA AGGAACTCCT TCAAAGGCCA TGCTGCGACT CCTACAAAGT 2100 
GCTTTTAGCA AGAATGCACT TTCAAAACAA TCACCAAAGA AGTCTCCAAG 2150 
TGCAAAGCTC AGCATCCCTT ATGAAAGTTT CTATGAAGCC TTGGAAAAGC 2200 
TTAACAAGCC CTCCAAGCTT GAAGGCTCTG AAGACAGTCT GTACAGTGAC 2250 
TATGTTGATG TATTCTATAA CACAAAACCT TATAAGCATA GAGATGACAG 2300 
GCTGCTGCAA GCTCTCATGG ACATCCTAAA TGAGGAGAAT TAAAATAAGG 2350 
AGCTC 2355 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2012 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



TCTAGATGCA TCTTCCCTCT TCGTCCCCTG CTCCACCACC CTGCGCGCCT 50 
CACAGCCCCG CTTTTCACTC CCAAAGAAGG ATGGAGGGCG GTTGTGGATC 100 
CCAGTGGAAG GCGGCCGGGT TCCTCTTCTG TGTGATGGTT TTTGCGTCTG 150 
CCGAGAGACC CGTCTTCACG AATCATTTTC TTGTGGAGTT GCATAAAGAC 200 
GGAGAGGAAG AGGCTCGCCA AGTTGCAGCA GAACACGGCT TTGGAGTCCG 250 
AAAGCTCCCC TTTGCAGAAG GCCTGTATCA CTTTTATCAC AATGGGCTTG 300 
CAAAGGCCAA AAGAAGACGC AGCCTACACC ATAAGCGGCA GCTAGAGAGA 350 
GACCCCAGGA TAAAGATGGC GCTGCAACAA GAAGGATTTG ACCGTAAAAA 400 
GAGAGGGTAC AGGGACATCA ATGAGATTGA CATCAACATG AATGATCCTC 450 
TCTTTACAAA GCAATGGTAC CTGTTCAACA CTGGGCAAGC CGATGGAACT 500 
CCTGGGCTAG ACTTGAACGT GGCCGAAGCC TGGGAGCTGG GATACACAGG 550 
AAAAGGAGTG ACCATTGGAA TTATGGATGA TGGAATTGAC TATCTCCACC 600 
CAGACCTGGC CTACAACTAC AACGCTGATG CAAGTTATGA CTTCAGCAGC 650 
AATGACCCCT ACCCATACCC TCGATACACA GATGACTGGT TCAACAGCCA 700 
TGGAACTAGG TGTGCAGGAG AAGTTTCTGC TGCAGCCAGC AACAATATCT 750 
GTGGAGTCGG CGTAGCATAC AACTCCAAGG TGGCAGGGAT CCGGATGCTG 800 , 
GACCAGCCCT TTATGACAGA CATCATCGAA GCCTCCTCCA TCAGCCACAT 850 
GCCTCAACTG ATCGACATCT ACAGTGCAAG CTGGGGCCCC ACAGACAATG 900 
GGAAGACGGT TGATGGGCCC CGAGAGCTCA CACTCCAGGC CATGGCTGAT 950 
GGCGTGAACA AGGGCCGTGG GGGCAAAGGC AGCATCTATG TGTGGGCCTC 1000 
TGGGGACGGT GGCAGCTACG ATGACTGCAA CTGTGACGGC TATGCTTCAA 1050 
GCATGTGGAC CATCTCCATC AACTCAGCCA TCAATGATGG CAGGACTGCC 1100 
TTGTATGATG AGAGTTGCTC TTCCACCTTA GCATCCACCT TCAGCAATGG 1150 
GAGGAAGAGG AATCCTGAGG CTGGTGTGGC TACCACAGAC TTGTATGGCA 1200 
ACTGTACTCT GAGACACTCT GGGACATCTG CAGCTGCTCC GGAGGCAGCT 12 50 
GGCGTGTTTG CATTAGCTTT GGAGGCTAAC CTGGATCTGA CCTGGAGAGA 1300 
CATGCAACAT CTGACTGTGC TCACCTCCAA GCGGAACCAG CTTCATGATG 1350 
AGGTTCATCA GTGGCGACGG AATGGGGTTG GCCTGGAATT TAATCACCTC 14 00 
TTTGGCTACG GAGTCCTTGA TGCAGGTGCC ATGGTGAAAA TGGCTAAAGA 14 50 



CTGGAAAACT GTTCCGGAGA GATTCCATTG TGTGGGAGGC TCTGTGCAGA 1500 
ACCCTGAAAA AATACCACCC ACCGGCAAGC TGGTACTGAC CCTCAAAACA 1550 
AATGCATGTG AGGGGAAGGA AAACTTCGTC CGCTACCTGG AGCACGTCCA 1600 
AGCTGTCATC ACAGTCAACG CGACCAGGAG AGGAGACCTG AACATCAACA 1650 
TGACCTCCCC AATGGGCACC AAGTCCATTT TGCTGAGCCG GCGTCCCAGA 1700 
GACGACGACT CCAAGGTGGG CTTTGACAAG TGGCCTTTCA TGACCACCCA 1750 
CACCTGGGGG GAGGATGCCC GAGGGACCTG GACCCTGGAG CTGGGGTTTG 1800 
TGGGCAGTGC ACCACAGAAG GGGTTGCTGA AGGAATGGAC CCTGATGCTT 1850 
CACGGCACAC AGAGCGCCCC ATACATCGAT CAGGTGGTGA GGGATTACCA 1900 
GTCTAAGCTG GCCATGTCCA AGAAGCAGGA GCTGGAGGAA GAGCTGGATG 1950 
AGGCTGTGGA GAGAAGCCTG CAAAGTATCC TGAGAAAGAA CTAGGGCCAC 2000 
GCTTCCGAAT TC 2012 
(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 amino acids ■ r.;;; 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: . Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Glu Gin Arg Gly Trp Thr Leu Gin Cys Thr Ala Phe Ala Phe 
15 10 15 

Phe Cys Val Trp Cys Ala Leu Ser Ser Val Lys Ala Lys Arg Gin 
20 25 30 

Phe Val Asn Glu Trp Ala Ala Glu He Pro Gly Gly Gin Glu Ala 
35 40 45 

Ala Ser Ala He Ala Glu Glu Leu Gly Tyr Asp Leu Leu Gly Gin 
50 55 60 

He Gly Ser Leu Glu Asn His Tyr Leu Phe Lys His Lys Ser His 
65 70 75 

Pro Arg Arg Ser Arg Arg Ser Ala Leu His He Thr Lys Arg Leu 
80 85 90 

Ser Asp Asp Asp Arg Val Thr Trp Ala Glu Gin Gin Tyr Glu Lys 

95 100 105 

Glu Arg Ser Lys Arg Ser Val Gin Lys Asp Ser Ala Leu Asp Leu 
110 115 120 



Phe Asn Asp Pro Met Trp Asn Gin Gin Trp Tyr Leu Gin Asp Thr 
125 130 135 



Arg Met Thr Ala Ala Leu Pro Lys Leu Asp Leu His Val lie Pro 
140 145 150 

Val Trp Glu Lys Gly He Thr Gly Lys Gly Val Val He Thr Val 
155 160 165 

Leu Asp Asp Gly Leu Glu Trp Asn His Thr Asp He Tyr Ala Asn 
170 175 180 

Tyr Asp Pro Glu Ala Ser Tyr Asp Phe Asn Asp Asn Asp His Asp 
185 190 195 



Pro Phe Pro Arg Tyr Asp Leu Thr Asn Glu Asn Lys His Gly Thr 
200 205 210 

Arg Cys Ala Gly Glu He Ala Met Gin Ala Asn Asn His Lys Cys 
215 220 225 

Gly Val Gly Val Ala Tyr Asn Ser Lys Val Gly Gly He Arg Met 
230 235 240 

Leu Asp Gly He Val Thr Asp Ala He Glu Ala Ser Ser He Gly 
245 250 255 

Phe Asn Pro . Gly His Val Asp He Tyr. Ser Ala Ser Trp Gly Pro- ■ 
260 265 270 

Asn Asp Asp Gly Lys Thr Val Glu Gly Pro Gly Arg Leu Ala Gin 
275 280 285 



Lys Ala Phe Glu Tyr Gly Val Lys Gin Gly Arg Gin Gly Lys Gly 
290 295 300 

Ser He Phe Val Trp Ala Ser Gly Asn Gly Gly Arg Gin Gly Asp 
305 310 315 

Asn Cys Asp Cys Asp Gly Tyr Thr Asp Ser He Tyr Thr He Ser 
320 325 330 

He Ser Ser Ala Ser Gin Gin Gly Leu Ser Pro Trp Tyr Ala Glu 
335 340 345 

Lys Cys Ser Ser Thr Leu Ala Thr Ser Tyr Ser Ser Gly Asp Tyr 
350 355 360 

Thr Asp Gin Arg He Thr Ser Ala Asp Leu His Asn Asp Cys Thr 
365 370 375 

Glu Thr His Thr Gly Thr Ser Ala Ser Ala Pro Leu Ala Ala Gly 
380 385 390 



He Phe Ala Leu Ala Leu Glu Ala Asn Pro Asn Leu Thr Trp Arg 
395 400 405 



Asp Met Gin His Leu Val Val Trp Thr Ser Glu Tyr Asp Pro Leu 
410 415 420 



Ala Ser Asn Pro Gly Trp Lys Lys Asn Gly Ala Gly Leu Met Val 
425 430 435 

Asn Ser Arg Phe Gly Phe Gly Leu Leu Asn Ala Lys Ala Leu Val 
440 445 450 

Asp Leu Ala Asp Pro Arg Thr Trp Arg Asn Val Pro Glu Lys Lys 
455 460 465 

Glu Cys Val Val Lys Asp Asn Asn Phe Glu Pro Arg Ala Leu Lys 
470 475 480 

Ala Asn Gly Glu Val lie Val Glu He Pro Thr Arg Ala Cys Glu 
485 490 495 

Gly Gin Glu Asn Ala He Lys Ser Leu Glu His Val Gin Phe Glu 
500 505 510 

Ala Thr He Glu Tyr Ser Arg Arg Gly Asp Leu His Val Thr Leu 
515 520 525 

Thr Ser Ala Val Gly Thr Ser Thr Val Leu Leu Ala Glu Arg Glu 
530 535 540 

Arg Asp Thr Ser Pro Asn Gly Phe Lys Asn Trp Asp Phe Met Ser 
545' 550 555 

Val His Thr Trp Gly Glu Asn Pro Val Gly Thr Trp Thr Leu Lys 
560 565 570 

He Thr Asp Met Ser Gly Arg Met Gin Asn Glu Gly Arg He Val 
575 580 585 

Asn Trp Lys Leu He Leu His Gly Thr Ser Ser Gin Pro Glu His 
590 595 600 

Met Lys Gin Pro Arg Val Tyr Thr Ser Tyr Asn Thr Val Gin Asn 
605 610 615 

Asp Arg Arg Gly Val Glu Lys Met Val Asn Val Val Glu Lys Arg 
620 625 630 

Pro Thr Gin Lys Ser Leu Asn Gly Asn Leu Leu Val Pro Lys Asn 
635 640 645 

Ser Ser Ser Ser Asn Val Glu Gly Arg Arg Asp Glu Gin Val Gin 
650 655 660 



Gly Thr Pro Ser Lys Ala Met Leu Arg Leu Leu Gin Ser Ala Phe 
665 670 675 



Ser Lys Asn Ala Leu Ser Lys Gin Ser Pro Lys Lys Ser Pro Ser 
680 685 690 



Ala Lys Leu Ser lie Pro Tyr Glu Ser Phe Tyr Glu Ala Leu Glu 
695 700 705 



Lys Leu Asn Lys Pro Ser Lys Leu Glu Gly Ser Glu Asp Ser Leu 

710 715 720 

Tyr Ser Asp Tyr Val Asp Val Phe Tyr Asn Thr Lys Pro Tyr Lys 

725 730 735 

His Arg Asp Asp Arg Leu Leu Gin Ala Leu Met Asp lie Leu Asn 

740 745 750 



Glu Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Glu Gly Gly Cys Gly Ser Gin Trp Lys Ala Ala Gly Phe Leu 

15 10 15 

Phe Cys Val Met Val Phe Ala Ser Ala Glu Arg Pro Val Phe, Thr 

20 25 30 

Asn His Phe Leu Val Glu Leu His Lys Asp Gly Glu Glu Glu Ala 

35 40 45 

Arg Gin Val Ala Ala Glu His Gly Phe Gly Val Arg Lys Leu Pro 

50 55 60 

Phe Ala Glu Gly Leu Tyr His Phe Tyr His Asn Gly Leu Ala Lys 

65 70 75 



Ala Lys Arg Arg Arg Ser Leu His His Lys Arg Gin Leu Glu Arg 

80 85 90 

Asp Pro Arg He Lys Met Ala Leu Gin Gin Glu Gly Phe Asp Arg 

95 100 105 

Lys Lys Arg Gly Tyr Arg Asp He Asn Glu He Asp He Asn Met 

110 115 120 

Asn Asp Pro Leu Phe Thr Lys Gin Trp Tyr Leu Phe Asn Thr Gly 

125 130 135 

Gin Ala Asp Gly Thr Pro Gly Leu Asp Leu Asn Val Ala Glu Ala 

140 145 150 

Trp Glu Leu Gly Tyr Thr Gly Lys Gly Val Thr He Gly He Met 

155 160 165 



Asp Asp Gly lie Asp Tyr Leu His Pro Asp Leu Ala Tyr Asn Tyr 
170 175 180 

Asn Ala Asp Ala Ser Tyr Asp Phe Ser Ser Asn Asp Pro Tyr Pro 
185 190 195 

Tyr Pro Arg Tyr Thr Asp Asp Trp Phe Asn Ser His Gly Thr Arg 
200 205 210 

Cys Ala Gly Glu Val Ser Ala Ala Ala Ser Asn Asn He Cys Gly 
215 220 225 

Val Gly Val Ala Tyr Asn Ser Lys Val Ala Gly He Arg Met Leu 
230 235 240 



Asp Gin Pro Phe Met Inr Asp He He Glu Ala Ser Ser He Ser 
245 250 . 255 

• His M6t Pro Gin Leu He Asp He Tyr Ser Ala Ser Trp Gly Pro 
260 265 270 

Thr Asp Asn Gly Lys Thr Val Asp Gly Pro Arg Glu Leu Thr Leu 
275 280 285 

Gin Ala Met Ala Asp Gly Val Asn Lys Gly Arg Gly Gly Lys Gly ■ 
290 295 300 

Ser He Tyr Val Trp: Ala Ser Gly Asp Gly Gly Ser Tyr Asp Asp-w-^" 
305 310 315 

Cys Ash Cys Asp Gly Tyr Ala Ser Ser Met Trp Thr He Ser He 
320 325 330 

•Asn Ser Ala He Asn Asp Gly Arg Thr Ala Leu Tyr Asp Glu Ser 
335 340 345 

Cys Ser Ser Thr Leu Ala Ser Thr Phe Ser Asn Gly Arg Lys Arg 
350 355 360 

Asn Pro Glu Ala Gly Val Ala Thr Thr Asp Leu Tyr Gly Asn Cys 
365 370 375 

Thr Leu Arg His Ser Gly Thr Ser Ala Ala Ala Pro Glu Ala Ala 
380 385 390 

Gly Val Phe Ala Leu Ala Leu Glu Ala Asn Leu Asp Leu Thr Trp 
395 400 405 

Arg Asp Met Gin His Leu Thr Val Leu Thr Ser Lys Arg Asn Gin 
410 415 420 

Leu His Asp Glu Val His Gin Trp Arg Arg Asn Gly VaJ Gly Leu 
425 430 435 

Glu Phe Asn His Leu Phe Gly Tyr Gly Val Leu Asp Ala Gly Ala 
440 445 450 



Met Val Lys Met Ala Lys Asp Trp Lys Thr Val Pro Glu Arg Phe 

455 460 465 

His Cys Val Gly Gly Ser Val Gin Asn Pro Glu Lys He Pro Pro 

470 475 480 

Thr Gly Lys Leu Val Leu Thr Leu Lys Thr Asn Ala Cys Glu Gly 

485 490 495 



5. ,iiS.!l',tls!p!«-- 



Lys Glu Asn Phe Val Arg Tyr Leu Glu His Val Gin Ala Val He 
500 505 510 

Thr Val Asn Ala Thr Arg Arg Gly Asp Leu Asn He Asn Met Thr 
515 520 525 

Ser Pro Met Gly Thr Lys Ser He Leu Leu Ser Arg Arg Pro Arg 
530 535 540 

Asp Asp Asp Ser Lys Val Gly Phe Asp Lys Trp Pro Phe Met Thr 
545 550 555 

Thr His Thr Trp Gly Glu Asp Ala Arg Gly Thr Trp Thr Leu Glu 
560 565 570 

Leu Gly Phe Val Gly Ser Ala Pro Gin Lys Gly Leu Leu Lys Glu ■ 
575 580 585 

Trp Thr LeU Met Leu His Gly Thr Gin Ser Ala Pro Tyr He Asp ■ 

590 595 600 ' 

Gin Val Val Arg Asp Tyr Gin Ser Lys Leu Ala Met Ser Lys Lys 
605 610 615 




Gin Glu Leu Glu Glu Glu Leu Asp Glu Ala Val Glu Arg Ser Leu 
620 625 630 

Gin Ser He Leu Arg Lys Asn 
635 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GCAAAATCTA GAYKGCNATY GTNGAYGAKG GN 32 
(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 



(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 



AAGCATGAGC TCNGGRGCRG CRGCNGANCC 30 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



GATATCACTC AGATCGATGA ATTCGAGCTC 30 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



AAGCTTTCTA GAGGATCCCT CTGGTGGATT TGG 33 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



AAGCTTGAAT TCTCCAACCC CACACTTGTG 30 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 



AGATCGATGA ATTCGAGCTC 20 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



CATTCTCGAA AAAAGAGACA A 21 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



CATTCTGTAA AAAAGAGACA A 21 
(2) . INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



CATTCTAGAG CAAAGAGACA A 21 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



CATTCTAGAA AAGCAAGACA A 21 
(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 



CATTCTAGAA AAAGAGCACA A 21 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



ACCTGGAGCA AAGCTTCTCT G 21 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



ACCTGGAGCG CTAGGTCTCT G 21 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CATAAGCTTA CCATGGCCCT GTGGATGCGC 30 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 



(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



CATTCTAGAC TAGTTGCAGT AGTTCTCCAG 30 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Lys Thr Arg Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Leu Gin Lys Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Arg Thr Lys Arg 
1 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Arg Gin Lys Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

Lys Thr Lys Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 



CTCTGCCTCC CGCTTGGTCC TGGGTGTGTA G 31 

(2)- INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
• :, (B) TYPE: Nacleic Acid 

; ;,(C) STRANDEDNESS: Single 
'.'(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26 



CACGCTTCTG CCGGGATCCC TC 22 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 



CTCTGCCTCC CGCTTGGTCT TGGGTGTGTA G 31 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



GATATGAAGA GCAGATCTTT TGGACCTCCG AGGATG 36 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



CTTATGGTGT AAGCTTCGTT TTGCTCTGGC CTTTGCAAG 3 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) . TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



TACAACTCAC CGCGGGTCCT G 21 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



AAGATGGGAT GGGATGATGA CCGTTTCCGC CTTGATGT 38 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



ACATCACGGC GGAAACGGTC ATCATCCCAT CCCATCTT 38 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



GATATAAGCT TGAGAGTGTA GAAGGGGC 28 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 oase pairs 

(B) TYPE: Nucleic. Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

; (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 



GGCTTGACAT GATTGGCTGA CACTTTCGAA CACATGATAG 

(2). INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
■ . (A)" LENGTH: 41 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 



AAGATGGGAT GGGATGATGA GCGCCGGACC CTCATGGACA 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



ATGTCCATGA GGGTCCGGCG CTCATCATCC CATCCCATCT T 
(2) INFORMATION FOR SEQ ID NO: 37: 



0 




41 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: Nucleic Acid 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37; 



GATATAAGCT TGAGAGTGTA GAAGGGGC 28 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TGAGCGACAG CACCCCCTTG GAGCCCCCGC CCTTGTATCT CATGGAGGAT 50 
T 51 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 base pairs 
■ (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single- ■ 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

TCAGCTCCCC TCGTCGGGCG GGGTCCGAGT GCCGTTTCCG CCGTGATGTT 50 
C 51 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



ACGTGGGCAG CCCCGTGGTG GCGAACAGAA CATCACGGCG GAAACGGC 4 8 
(2) INFORMATION FOR SEQ ID NO: 41: 

(iy SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 



(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



TGTTCGCCAC CACGGGGCTG CCCACGTAAT CCTCCATGAG ATACAAGG 4 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



CTCGGACCCC GCCCGACGAG GGGAGC 26 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



GCGGGGGCTC CAAGGGGGTG CTGTCGC 27 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



GTAGAACAAC ATGGACATGG TGGCAATATT GTCGACTCTG GAGTCGACCT 
GCAG 54 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



CACATAAAAC AAGATGGACA TGGTCTTGTT CACCTGTAGG ATCCCCGG 4 8 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



AGTAAGGAAA AGGATGGTCA TGGTGGAGGT CGACAAGCTT GAGAATTCAA 50 
TCG 53 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

.. Xaa Xaa Xaa Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 48: 

('i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

His Ser Arg Lys Lys Arg Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO : 4 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

His Ser Val Lys Lys Arg Gin 

1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

His Ser Arg Ala Lys Arg Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

His Ser Arg Lys Ala Arg Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

His Ser Arg Lys Arg Ala Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Thr Trp Ser Lys Ala Ser Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 




Thr Trp Ser Ala Arg Ser Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 



CATAAGCTTA CCATGGCCCT GTGGATGCGC 30 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) ■ SEQUENCE DESCRIPTION: SEQ ID NO: 56 

' ' - . 

CATTCTAGAC TAGTTGCAGT AGTTCTCCAG 30 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 



GCTTCCACCA GGTCGGATCC GCACAGGTG 29 



